Optimization for Hybrid MPI-OpenMP Programs on a Cluster of SMP PCs
نویسندگان
چکیده
This paper applies a Hybrid MPI-OpenMP programming model with a thread-to-thread communication method on a cluster of Dual Intel Xeon Processor SMPs connected by a Gigabit Ethernet network. The experiments include the well-known HPL and CG benchmarks. We also describe optimization techniques to get a high cache hit ratio with the given architecture. As a result, the hybrid model shows performance prominence over the pure MPI model with about 27% for CG and 12% for HPL. Besides, with a relatively small programming effort, we have succeeded in reducing the cache miss ratio and thus significantly risen up performance for the CG benchmark as much as 4.5 times in some cases.
منابع مشابه
Parallel computing using MPI and OpenMP on self-configured platform, UMZHPC.
Parallel computing is a topic of interest for a broad scientific community since it facilitates many time-consuming algorithms in different application domains.In this paper, we introduce a novel platform for parallel computing by using MPI and OpenMP programming languages based on set of networked PCs. UMZHPC is a free Linux-based parallel computing infrastructure that has been developed to cr...
متن کاملNesting OpenMP in MPI to Implement a Hybrid Communication Method of Parallel Simulated Annealing on a Cluster of SMP Nodes
Concurrent computing can be applied to heuristic methods for combinatorial optimization to shorten computation time, or equivalently, to improve the solution when time is fixed. This paper presents several communication schemes for parallel simulated annealing, focusing on a combination of OpenMP nested in MPI. Strikingly, even though many publications devoted to either intensive or sparse comm...
متن کاملA Hybrid MPI-OpenMP Implementation of an Implicit Finite-Element Code on Parallel Architectures
The hybrid MPI-OpenMP model is a natural parallel programming paradigm for emerging parallel architectures that are based on symmetric multiprocessor (SMP) clusters. This paper presents a hybrid implementation adapted for an implicit finite-element code developed for groundwater transport simulations. The original code was parallel-ized for distributed memory architectures using MPI (Message Pa...
متن کاملAnalysis of Parallel Algorithms on SMP Node and Cluster of Workstations Using Parallel Programming Models with New Tile-based Method for Large Biological Datasets
Sequence alignment is an important tool for describing the relationships between DNA sequences. Many sequence alignment algorithms exist, differing in efficiency, in their models of the sequences, and in the relationship between sequences. The focus of this study is to obtain an optimal alignment between two sequences of biological data, particularly DNA sequences. The algorithm is discussed wi...
متن کاملParallel Iterative Solvers for Ill-Conditioned Problems with Reordering
1. Preconditioned Iterative Solvers with Multicoloring In the previous work [1], author developed an efficient parallel iterative solver for finite-element applications on the Earth Simulator (ES) [2] using multi-level hybrid parallel programming model with MPI and OpenMP. The method employs three-level hybrid parallel programming model for SMP cluster architectures, consisting of MPI, OpenMP a...
متن کامل